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ABSTRACT 

A quantum algorithm is exact if, on any input data, it out- 
puts the correct answer with certainty (probability 1). A 
key question is: how big is the advantage of exact quantum 
algorithms over their classical counterparts: deterministic 
algorithms. For total Boolean functions in the query model, 
the biggest known gap was just a factor of 2: PARITY of 
TV inputs bits requires TV queries classically but can be com- 
puted with [TV/2] queries by an exact quantum algorithm. 

We present the first example of a Boolean function f(xi, ...,xn) 
for which exact quantum algorithms have superlinear advan- 
tage over the deterministic algorithms. Any deterministic al- 
gorithm that computes our function must use TV queries but 
an exact quantum algorithm can compute it with o(TV 8675 ■ ) 
queries. A modification of our function gives a similar re- 
sult for communication complexity: there is a function / 
which can be computed by an exact quantum protocol that 
communicates O(TV ' 8675 '" ) quantum bits but requires fi(TV) 
bits of communication for classical protocols. 

1. INTRODUCTION 

Quantum algorithms can be either studied in the bounded- 
error setting (the algorithm must output the correct answer 
with probability at least 2/3, for every input) or in the exact 
setting ( the algorithm must output the correct answer with 
certainty, for every input). For the bounded-error case, there 
are many quantum algorithms that are better than classical 
algorithms ( [3UH16I fST 115] and many others). 

It is much more difficult to come up with exact quantum al- 
gorithms that outperform classical algorithms. The require- 
ment that the algorithm's answer must always be correct 
is very constraining: it means that, in the algorithm's final 
state, we cannot have even very small non-zero amplitudes 
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for the basis states that correspond to an incorrect answer. 
Arranging the algorithm's transformations so that this re- 
quirement is satisfied for all of the possible inputs has been 
a very challenging problem. 

We consider computing Boolean functions in the query model. 
Let Qe(/) (Q2(f)) be the smallest number of queries in an 
exact (bounded-error) quantum algorithm that computes / 
and D(f) be the smallest number of queries in a determin- 
istic algorithm that computes /. For total Boolean func- 
tions, the biggest gap between Qe(/) and D(f) has been 
achieved for the PARITY of TV input bits. A modification 
of Deutsch's algorithm [13] discovered by Cleve et al. [11] 
can compute PARITY of 2 input bits exactly with just 1 
quantum query. This immediately implies that PARITY of 
TV bits can be computed with [A/2] queries. In contrast, 
deterministic algorithms need TV queries to compute PAR- 
ITY. 

Bigger speedups are known for partial functions. For ex- 
ample, Simon's algorithm [31] can be made exact [6]. This 
gives a partial function f(xi, . . . , xn) for which Qe(/) = 
O(logTV) and D(f) = fi(-//V). The value of this function 
f(xi, . . . ,xn) is, however, defined only for very small frac- 
tion of all inputs (xi, . . . , xn). 

Many attempts have been made to come up with exact quan- 
tum algorithms for total functions but the best results have 
been algorithms that achieve the same separation as for the 
PARITY function: Q E (f) = TV/2 vs. D(f) = TV (either by 
using the parity algorithm by a subroutine (e.g. or by 

different methods 1221). 



In this paper, we give the first separation between Qb(/) 
and D(f) that is more than a factor of 2. Namely, we obtain 
QeU) = O(D(f) 0S675 -) for a sequence of functions f (with 
£>(/) -►<»). 



The sequence of functions is as follows, 
function NE{x\ , x%, Xs) defined by 



We start with the 



• NE(xi,X2,xz) = 1 if Xi 7^ Xj for some i,j £ {1, 2, 3}; 

• N E(x\,X2,xz) = if x\ — X2 — £3. 

We define NE°(xi) = xi and 



NE d (x 1 ,...,x 3d ) = NE{NE d -^{x x ,...,x %d -i), 



NE d 1 {x 3d -i +1 , . . . ,x 2 . 3 d-i), NE d 1 (x 2 . 3 d-i +1 , . . . ,x 3 d)) 

for d > 0. This sequence of functions has been known as 
a candidate for a superlinear separation between D(f) and 
Qe(/) for a long time (since at least [IQ]). The reason for 
that is the relationship between and the polynomial 

degree of / [4]. 

Let deg(f) be the degree of the unique multilinear poly- 
nomial that is equal to f(xi, . . . , xjsr). Then [4], D(f) > 
Qmtf) > deg(f)/2. If we also have D(f) = deg{f) (which 
is true for many functions /), this implies that Qe{J) > 
D{f)/2. 

To obtain a bigger gap between D(f) and Qe{/), we should 
start with / which has D(f) > deg(f). NE d has this prop- 
erty. We have D(NE) = 3 and deg(NE) = 2 which implies 
D(NE d ) = 3 d and deg(NE d ) = 2 d [24]. (There are other 
constructions of functions with D(f) > deg(f), by taking a 
different basis function / instead of NE and iterating it in 
the same way [201 \T\.) 

Buhrman and de Wolf [10] observed that this means the fol- 
lowing. If we determine Qe(f), we will either get Qe(J) = 
o(D(f)) or deg(f) — o(Qe(/)) (showing that the degree 
lower bound on Q E (f) is n °t tight). The second of those 
results was obtained in [I] which showed that QE(NE d ) > 
Q 2 (NE d ) = fi(2.121... d )". For bounded error algorithms, the 
work on negative adversary bound [18] and span programs 
[291 1271 |2"8] resulted in the conclusion that this bound is 
optimal: Q 2 (NE d ) = 9(2.121. .. d ). 

Even though the function NE d is quite well known, no pos- 
itive results on its exact complexity have been obtained. In 
this paper, we provide the first nontrivial exact quantum al- 
gorithm for NE d , showing that Q E {NE d ) = 0(2.593... d ) = 
O{D(NE d ) - S675 -). 

Main ideas. The main ideas behind our algorithm are as 
follows. We create an algorithm in which one basis state 
has amplitude 1 if NE d (xi, . . . , x 3 d) = and an ampli- 
tude a < 1 if NE d (xi, . . . ,x 3 d) = 1. The algorithm is 
constructed so that a is the same for all (xi, . . . ,x 3 d) with 
NE (xi, . . . , x 3 d) = 1. The construction is by induction: 
we use the algorithm of this type for NE d ~ 1 as a subroutine 
to construct the algorithm for NE d . 

Each such induction step decreases the difference between 
the NE d = and NE d = 1 cases, bringing a closer to 1. 
To compensate for that, we interleave the induction steps 
with a form of quantum amplitude amplification X which 
increases the difference between a and 1. At the end, we 
perform the amplitude amplification again, to construct an 
algorithm that perfectly distinguishes between the two cases. 

Communication complexity. Besides query complexity, 
our result also applies to the setting of communication com- 
plexity (in the standard two-party communication model). 
Then, we get a total function f(yi, . . . , zjv) which can be 
computed by an exact quantum protocol that communi- 
cates O(N - ms - ■■ log TV) quantum bits but requires fl(N) bits 
for classical protocols in a variety of models (deterministic, 
bounded-error probabilistic and nondeterministic protocol) 



Previously, it was known that a classical protocol that com- 
municates k bits can be converted into a quantum proto- 
col that uses shared entanglement and k/2 quantum bits of 
communication (via quantum teleportation 0). However, 
no provable gap of any size between exact quantum and 
deterministic communication complexity was known for a 
total function in the case when the quantum protocol does 
not have shared entanglement. (A communication complex- 
ity version of the Deutsch-Jozsa problem [141 [8] gives an 
exponential gap for a partial function.) 

2. DEFINITIONS 

We assume familiarity with the standard notions of quantum 
states and transformations [23]. We now briefly define the 
quantum query model, to synchronize the notation with the 
reader. 

We assume that the task is to compute a Boolean function 
f(xi, . . . ,xn), Xi, . . . ,xn £ {0, 1}. We consider a Hilbert 
space "H with basis states \i,j) for i G {0,1,..., TV} and 
j G {1,...,M} (where M can be chosen arbitrarily). We 
define that a query Q is the following transformation: 

• Q\0,j) = |0,i); 

• Q\hi) = (-i) x %j) for ie {1,2,..., N}. 

A quantum query algorithm A consists of a sequence of 
transformations Uq, Q, Ui, . . ., Uk-i, Q, Uk where Q are 
queries and Ui are arbitrary unitary transformations that 
do not depend on xi, . . . , xn ■ The algorithm starts with a 
fixed starting state \4>start) and performs the transforma- 
tions. This leads to the final state 

|VW) = UkQUk-l . • • UlQU \i>start)- 

We then measure this state and interpret the result as a bi- 
nary value y 6 {0, 1}. (That is, we define some of possible 
results as corresponding to y = and others as correspond- 
ing to y = 1.) 

An algorithm A computes f(xi, . . . , xn) exactly if, for every 
xi, . . . , xn G {0, 1}, the obtained value y is always equal to 
f(xi, . . .,X N ). 

3. RESULTS AND PROOFS 
3.1 Results 

Our main result is 



Theorem 1. 

Q E {NE d ) = 0(2.593... d ). 

Since D(NE d ) = 3 d , this means that 

Q E {NE d ) = 0(D(NE d ) - 8675 -), 

giving a polynomial separation between D(f) and Q E (f). 
One can also show that R,2(NE d ), the probabilistic query 
complexity of NE d , is equal 3 d , as well. Therefore, we also 
get the same separation between D(f) and i?2(,f). 



3.2 Framework 

In this subsection, we develop the tools for proving Theorem 
[TJ One of our main ideas is to construct algorithms which 
produce a final state in which one amplitude takes one of 
two values - depending on f(xi, . . . ,Xn). 

Definition 1. Let p 6 [—1,1]. A quantum query algo- 
rithm A p-computes a function f(xi, . . . , xn) if: 

(a) A\tl>start) = \^start) whenever f(xi, . . . , x N ) = 0; 

(b) if f(xi, . . . , xn) = 1, then 

A\lpstart) = p\lpstart) + yl — P 2 \lp) 

for some \ip) : \ip) _L \ip start) (where \ip) may depend 
on xi, ... , x n ); 

We note that p-computing a function becomes easier when 
p increases. The easiest case is p — 1 when any function can 
be 1-computed in a trivial way by performing the identity 
transformation I on \tp start)- For p = 0, an algorithm A that 
0-computes / is also an exact algorithm for / in the usual 
sense because we can measure whether the final state of A 
is \ipstart) is \i>start) or orthogonal to \%l> 3 tart) and output 
in the first case and 1 in the second case. 

We also have 



Lemma 1. If an algorithm A p-computes f with k queries, 
there is an algorithm A 1 that p-computes f with k queries, 
for any p > p. 

Proof: We enlarge the state space of the algorithm by adding 
a new basis state |0, j) which is left unchanged by queries 
Q. We extend all Ui to the enlarged state space by defining 
Ui\0,j) ~ \0,j) and change the starting state to \ip' s tart) — 
cos a\ipstart) + sina|0, j). 

If / = 0, we have .Avatar*) = \ipstart) and, hence, A\tp' start ) = 

Wstart)- 

If / = 1, then Al^t™*) =p'\ip'start) + VT= ~WTW) (W) -L 
Wstart)) for p' = {4>'start\A\ip' s tart) ■ We have 

/ 2 - 2 

p = pcos a + sin a. 

By varying a over the interval [0, ^], we can achieve any 
value of p' between p' — p (for a = 0) and p' = 1 (for 

« = !).□ 

If we have an algorithm A that p-computes NE 1-1 , we 
can use it to build an algorithm A' that p'-computes NE l , 
through the following lemma. 

Lemma 2. // an algorithm A p-computes A r _B d_1 with k 
queries, there is an algorithm A' that p' -computes NE d with 
2k queries, for p' = 1 — 4 ^ 1 ~ p - ) . 



Proof: We assume that the algorithm A consists of trans- 
formations Uo, Q, Ui, . . ., Uk-i, Q, Uk, in a Hilbert space "H 
with basis states \i,j), i 6 {0, 1,.. .,3 d " 1 }, j £ {1, . . . ,M}. 
Let \%j) s tart) be the starting state of A. 

We consider a Hilbert space W with basis states \i,j), i £ 
{0, 1, • • • , 3 d }, j € {1, . . . , 3M}. In this Hilbert space, we can 
compute NE d ~ 1 (xi, . . . ,x 3 d-i), NE d ~ 1 (x 3d -i + 1 , . . . ,x 2 . 3 d-i), 
NE d ~ 1 (x 2 . 3 d-i +1 , . . . ,x 3 d) in parallel, in the following way. 

Let Hi (I £ {1,2,3}) be the subspace spanned by the ba- 
sis states \0,j), j £ {(I - 1)M + 1, . . AM} and \i,j), i £ 
{(/ - l)3 d - a + l,...^- 1 }, j £ {1,...,M}. We have an 
isomorphism Vj : Hi — > H defined by 

Vi|0,(/-l)Af + i) = |0,j), 

V l \(l-l)3 d - 1 +i,j) = \i,j). 

We define uf } = {V^U.Vi and \^ art ) = {Vtf\i> start). 
Then, the sequence of transformations 

with the starting state \ipst ar t) is a quantum algorithm p- 
computing the function A^i5 d_1 (a;( ; _ 1 - ) 3 < i-i + 1 , . . . , x l . 3 d-i). 

Let U'i be a transformation which is equal to on Hi, 
for each I £ {1,2,3}. Then, the sequence of transforma- 
tions Uq, Q, U[, . . ., t/£_i, Q, U' k can be used to p-compute 
any of NE d -\(l - l)x 3d -i + 1 , . . . ,x,. 3 d-i), for I £ {1, 2, 3}, 
depending on the starting state. Let 

V = U' k QU k _ 1 ...,U[QK 
denote the product of these transformations. 

Let T be the unitary transformation defined by: 

• T\i,' start ) = W start ) for \i>' start ) = e? =1 ^lCf); 

• T\ip) = —\ip) for any \ifj) that is a linear combination of 
IV'itLt), \4>itlrt)> \i>ftart) and satisfies \tfj) _L \ip' start ); 

• T\ip) = \ip) for any \tp) that is perpendicular to all of 

l^start)' ^startli l^start)- 

We take the algorithm A' which performs the transformation 
A' = V~ 1 TV, on the starting state \ipstart)- We claim that 
this algorithm p'-computes the function NE d . 

To prove it, we consider two cases. 

Case 1: NE d {x 1 , x 3d ) = 0. 

This means that we have one of two subcases. 

Case la: NE d ~ 1 (x (l _ 1 y 3d -i +1 , . . . , x h3 d-i) = for all I £ 
{1,2,3}. 

Then, given a starting state \ip^tart)t V performs the algo- 
rithm for p-computing 7V_E d_1 (j:(;_ 1 j3 < i-i +1 , . . . ,^ ; . 3 d-i) = 

0. This means that V\^ art ) = \^ art ) for all I £ {1,2,3} 



and, hence, V\ip' start ) = \ip' sta rt} ■ Since T\lp' start ) = \ip star t) 
(by the definition of T), we get that A\ip' start ) = Wstart)- 

Case lb: iV_B d_1 ((/ - l)aJ 3 d-i +1 , . . . , as,. 3 4-i) = 1 for all 
I 6 {1,2,3}. 

Then, V\i>i l l rt ) = p\^Lt) + yfi^W^) where |V>»> G 

Hi and |V (0 ) -1 IVSLt)- Trivially, we also have _L 

tyitlt) ^ I + I' (since G \^2rt) G «l' and 

_L Hi')- This means that 

Applying T to this state does not change it because the first 
component is equal to p\ip' s tart) an d the second component 
is orthogonal to all iV'itort)- Hence, 

V- l TV\il>' start ) = V- l VW start ) = \i>' 8tart ). 



Case 2: NE d {x 1 , . . . , x 3d ) = 1. 

In this case, we have to prove that (ipstart\V~ 1 TV\ip' start ) = 
V ■ 

We express V\ip' atart ) = a\ip+) + P\ip-) where T\tp + ) = \ip+) 
and r|V>_> = -\i>-) and a, /3 satisfy \a\ 2 + \/3\ 2 = 1. Then, 

To calculate /3, we consider two subcases: 

Case 2a: NE 1 (xn_ 1 \ 3 d-i +1 , . . . , x t . 3 d-i) = 1 for exactly 
one of I e {1,2,3}. 

Without loss of generality, we assume that this is / = 1. 
Then, 

VW start ) = -j= (MIL) + + IV&L) + I At>) 



We have 



2= (2p-2f +2 (1-P) 2 _ 2(1 -p) 2 



27 



27 
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Case 2b: 7V_E d 1 (x ( - ; _ 1 )3d-i + 1 x t , 3 d-i ) = 1 for exactly 
two of I G {1,2,3}. 

Without loss of generality, we assume that these are I = 1 
and I = 2. Then, 

V|Vw*> = ±= (p\i> { Zrt) + V^P~ 2 \^ l) ) 

+ P \i>fL t ) + \^i* (2) ) + l^?L>) • 

We have 



Applying Lemma [2] results in an algorithm with p' > p. 
Applying it several times degrades p even further (that is, 
p becomes closer and closer to 1). To compensate for that, 
we have the following lemma for improving p (by adapting 
quantum amplitude amplification [7] to our setting). 

Lemma 3. If an algorithm A p-computes a function f 
with k queries, for p = cos a, there is an algorithm, A' that 
p' -computes f with ck queries, for p' — cos ca. 

Proof: Let \ipstart) be the starting state of A. Let T be the 
transformation defined by T\tp s t ar t) = \ipstart) and T\ip) = 
— 1^> for all |-0> : _L 

The new algorithm A' has the same starting state \tp s tart) 
and consists of transformations Vi , T, Vi, T, ■ ■ ■ , T, V c where 
Vi = A for odd i and Vi = .4" 1 for even i. Since A and A' 1 
both use k queries and T can be performed with no queries, 
the new algorithm A' uses ck queries. 

If f(xi, . . . , xn) = 0, then (by definition [j} , A\ip s tart) = 
l^stort)- Since this also means A~ 1 \ipstart) = \ipstart), we 

get that A'\lpstart) = |VWrt). 

In the f(xi,. . . ,x N ) = 1 case, we have A\ip s tart) = p\ipstart) + 
yl — p' 2 \ipi) for some _L \ipstart). We can also express 

A~ 1 TA\l/j a tart) = COS P\lp sta rt) +Sm/3\lp2) 

for some \tp2) : |^2) J- \tpstart) and (3 G [0,7r]. (We will show 
that ft = 2a.) We define 

\lfstart) = COS Ct\lp start) +sina|l/>l), 

\(fi2) = sinalV'start) - cosa|V>i). 
Then, \(p s t ar t) -L |y>2). 

Claim 1. .4 maps \ip s tart) and \ip2) as follows: 

A\lpstart) — \ f start), 

A\*p 2 ) = Ivs). 

Proof: The first equality is immediate. For the second 
equality, we consider the three vectors 

A\lpstart) — \fstart), 
TA\lpstart) = COS a\tp start) ~ Sm«|?/'l), 

\if2) = sma\tpstart) — COS a\ipi) . 

These three vectors are in the same plane. Hence, A -1 
maps them to three vectors in the same plane and pre- 
serves the angles between the three vectors (since A^ 1 is 
unitary). The first two vectors are mapped to \ipstart) and 
cos/3|V> S i art )+sin/3|V>2), respectively. Hence, A -1 must map 
the third vector (\tfi2)) to a vector in the same plane that has 
the same angles with these two vectors - that is, to l^fe) • □ 

We now define Aj as the algorithm that consists of the first 
steps of A', from Vi to Vj. Then, A' = A c - 



and, similarly to the previous case, we get \/3\ 



Lemma [3] follows by substituting j — c into 



Claim 2. 



Aj\lj)start) = 



cos ja\ip s tart) + sin ja\tpi) if j is odd 
cos j oi\ip start) -sinja|V>2) if j is even 



Proof: By induction. The base case, j = 1, immediately 
follows from the definition of \tpi)- 

The inductive case is slightly different for even j and for 
odd j. If j is even, Aj = A~ 1 TAj-i. From the inductive 
assumption and the definition of T, 

TAj-i\i> start) = cos(j - l)a|^>start) - sin(j - l)a\ipi). 

We can express this state as 

cos ja\<p start) — sin ja\tp2). 

Therefore, 

A~ 1 TAj-l\lp start) = COS ja\lp start) — SU1 j«|l/> 2 ). 

If j is odd, Aj = ATAj-i. Then, 

TAj-i\tpstart) = cos(j - l)a\tpstart) + sin(j - l)a\lp 2 ), 

AT Aj-\\i> start) = cos(j - l)a\<fi a tart) + sin(j - l)a\ip 2 ) 

= COS ja\^start) + SU1 jafyl). 

□ □ 

As a special case of Lemma [3] we obtain 

Corollary 1. // an algorithm A p-computes NE d with 
k queries, there is an algorithm A' that p' -computes NE d 
with 2k queries, for p — 2p 2 — 1 . 

Proof: We set c = 2 in Lemma[3] Then p' — cos 2(arccos p) = 
2p 2 - 1. □ 

3.3 Algorithm for NE d 

It is easy to see that 



Lemma 4. // there is an algorithm A that —1-computes 
NE l with k queries, there is an algorithm Ai that —1-computes 
NE a with k l queries for all I > 1. 



Proof: By induction. The algorithm A itself forms the base 
case, for 1 = 1. 

For the inductive case, we can obtain the function NE tl 
by taking the function NE tfl ~^ and, instead of each vari- 
able, substituting a function NE f on a block of 3* new 
variables. Therefore, we can take the algorithm Ai-i that 
computes NE t "~ 1 ' > with fc !_1 queries and, instead of each 
query, substitute the algorithm A that performs A\ip) = \tp) 
if NE l = for the corresponding group of 3* variables and 
A\ip) = -|^) if NE* = 1. 

In the resulting algorithm Ai, each of queries of Ai 
is replaced by a sequence of transformations that involves 



k queries. Therefore, the total number of queries for Ai is 
k l ~ 1 k = k 1 . □ 

Moreover, we get that Q E (NE tl ) < k l , since an algorithm 
that —1-computes NE tl can be transformed into an algo- 
rithm that 0-computes NE , with no increase in the num- 
ber of queries (Lemma[TJ and an algorithm for O-computing 
/ can be used to compute / exactly. Therefore, we have 



Corollary 2. If there is an algorithm A that —1-computes 
NE f with k queries, then Q E {NE d ) = 0{k d/t ). 



It remains to find a base algorithm that —1-computes NE 
with less than 3 d queries. We give two such constructions. 

Construction 1: 

1. NE°(xi) = xi can be — 1-computed with 1 query by 
just performing a query |1) — > ( — l) 1111 1 1) . 

2. Applying Lemma [2] with p = —1 gives that NE 1 can 
be — | computed with 2 queries. 

3. Applying Lemma [2] with p = — | gives that NE 2 can 
be p'-computed with 4 queries for p' — — 

4. Applying Lemma[T]gives that NE 2 can be 0-computed 
with 4 queries. 

5. Applying Corollary [1] with p — gives that NE 2 can 
be —1-computed with 8 queries. 



By Corollary H this means that Q E (NE d ) = 0(8 d/2 ) = 
0(2.828... d ). The exponent can be improved by using a more 
complicated base construction with a larger number of steps. 

Construction 2: 



1. Start with an algorithm that — 2||- C omputes NE 2 with 



gives that NE 3 



4 queries (from Construction 1). 

2. Applying Lemma [2] with p = - 
can be p'-computed with 8 queries for p' — 4 5 7 8 8 8 2 6 9 6 6 5 9 = 
0.123075.... 

3. Applying Corollary [T]gives that NE 3 can be p'-computed 
with 16 queries for p = -0.969704.... 

4. Applying Lemma [2] 3 times gives that NE 6 can be 
p'-computed with 128 queries for p = 0.223874.... 

5. Applying Corollary[T]gives that NE 6 can be p'-computed 
with 256 queries for p' = -0.8997602.... 

6. Applying Lemma [2] 2 times gives that NE 8 can be 
p'-computed with 1024 queries for p' = —0.15353.... 

7. Applying Lemma [T] gives that NE S can be also 0- 
computed with 1024 queries and applying Lemma [T] 
gives that NE S can be —1-computed with 2048 queries, 



This means that Q E {NE d ) = O(2048 d/S ) = 0(2.593... d ). 



3.4 Implications for communication complex- 

ity 

We consider the problem of computing a function 

f(yi, . . . ,VN,Zi, . . .,z N ) 

in the communication complexity setting in which one party 
(Alice) holds y 1 , . . . , j/jv and another party (Bob) holds Z\, 
. . ., zn- The task is to compute f(yi, ■ ■ ■ , zjv) with the min- 
imum communication. (For a review on quantum communi- 
cation complexity, see [9].) 

An exact quantum algorithm is an algorithm that communi- 
cates k qubits and, after communicating those k qubits, both 
parties output answers that are always equal to f(yi, ■ ■ ■ ,zjy). 
The communication complexity counterpart of Theorem [1] is 
the following theorem (due to Ronald de Wolf [33]): 

Theorem 2. J33jj There exists f(yi, . . . , yjv, zi, . . . , zjv) 
such that 

1. f can be computed exactly by a quantum protocol that 
communicates O(A r0 ' 8675 ■" log AT) quantum bits; 

2. Any deterministic protocol (or bounded-error proba- 
bilistic protocol or nondetermmistic protocol) comput- 
ing f communicates at least Q(N) bits. 

Proof: Let N = 3 d . We define 

f(yi,- ■ ■ , z N ) = NE d (y 1 A zi, . . . , yn A z N ). 

As shown in [8], existence of a T query quantum algorithm 
for g(x\, . . . , xn) implies the existence of a quantum proto- 
col for g(yi Azi, . . . , j/jv Azjv) that communicates 0(T log N) 
quantum bits. (Alice runs the query algorithm and imple- 
ments each query through O(logiV) quantum bits of com- 
munication with Bob.) This implies the first part of the 
theorem. 

The second part follows by a reduction from the set disjoint- 
ness problem [191 126j : DISJ(yi, . . . , zjv) = 1 if the sets {i : 
yi — 1} and {i : Zi = 1} are disjoint and DISJ(yi, . . . , zjv) = 
if these two sets are not disjoint. This is equivalent to 

DISJ(yi,. ..,Zff) = NOT OR(yi A zi, . . . ,y N A z N ). 



Theorem 3. [19, 261 Any deterministic protocol (or bounded- 
error probabilistic protocol or nondeterministic protocol) for 
computing DISJ requires communicating Q,(N) bits, even if 
it is promised that y%, . . . , yjv and z\, . . . , zn are such that 
there is at most one i : yi — Zi = 1. 

We have NE d (x 1 , . . . , x N ) = NOT OR(xt , . . . , x N ) for in- 
puts (xi, . . . ,xn) with at most one i : Xi — 1. Hence, The- 
orem [3] implies the second part of Theorem [2] □ 

4. CONCLUSION AND OPEN PROBLEMS 

We have shown that, for the iterated 3- bit non-equality func- 
tion, Q E (NE m ) = O(D(NE m ) 0m75 --). This is the first ex- 
ample of a gap between Qb(/) and D(f) that is more than 



a factor of 2. We think that there are more exact quantum 
algorithms that are waiting to be discovered. 

Some possible directions for future work are: 

1. Can we improve on Q E {NE d ) = 0(2.593... d ), either 
by using our methods or in some other way? 

2. HQ B (NE d ) is asymptotically larger than Q 2 (NE d ) = 
0(2. 121. .. d ), can we prove that? There are cases in 
which we can show lower bounds on Qe(J) that are 
asymptotically larger than Q2(f) [1] but the typical 
lower bound methods are based on polynomial degree 
deg(f) and, thus, are unlikely to apply to NE d for 
which deg(NE d ) = 2 d is smaller than both Q E (NE d ) 
and Q 2 (NE d ). 

3. How big can the gap between Q E (f) and D(f) be? 
Currently, the best lower bound is Q E (f) = fi(-D(/) 1/3 ) 

4. There are other examples of functions f d (xi, . . . , x n d) 
with deg(f) — 0(D(f) c ), c < 1 obtained by iterat- 
ing different basis functions f(x\, . . . ,x n ) 20, 1 Can 
we show that Q E {f d ) = 0(D(f d ) c ), c < 1 for those 
functions as well? 

5. Computer experiments 22 show that exact quantum 
algorithms can be quite common: Q E (f) < D(f) for 
many functions / on a small number of variables (and, 
in most cases, an algorithm that uses Q E (f) queries 
cannot be produced simply by using the PARITY al- 
gorithm). 

Can we develop a general framework that will be able 
to produce exact algorithms for a variety of functions 
f(xi, . . .,x N )7 

6. What is Q E (f) for a random n- variable Boolean func- 
tion f(xi, . . . , xjv)? We know that Q E (f) < N (triv- 
ially) and Q E (f) > Q 2 (f) = f + o(N) 3 - Which of 
these two bounds is optimal? 

A similar question was recently resolved for Q 2 {f), by 
showing that Q 2 (f) = -f- + o(N) for a random /, with 
a high probability |3]. 
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